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ABSTRACT 


For  periodic  review  inventory  models  with  stochastic 
demand,  the-  idea  of  stock-out  risk  is  defined,  from  which 
the  importance  of  accurate  prediction  of  demand  is  deduced. 
Methods  of  demand  model  parameter  estimation  are  investigated 
and  several  methods  compared  on  the  basis  of  theoretical 
soundness,  ease  of  application,  and  accuracy  of  estimates 
based  upon  the  results  of  extensive  computer  simulation.  The 
theoretical  development  of  maximum  likelihood  and  exponential 
smoothing  estimators  as  applied  to  prediction  is  presented 
along  with  the  development  of  a  new  Bayesian  approach  to  the 
problem  of  demand  forecasting.  ) 
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I .  INTRODUCTI ON 


There  is  a  great  deal  of  interest  today  in  business  and 
industry,  as  well  as  government  activities,  in  inventory  con¬ 
trol.  The  current  literature  of  operations  research  contains 
considerable  ideas,  methods,  mathematical  models  and  algo¬ 
rithms  addressing  this  particular  subject  area.  One  of  the 
most  difficult  problems  involved  in  the  application  of  any 
mathematical  model  to  actual  inventory  analysis  or  control 
problems  is  the  estimation  of  the  parameters  used  in  the  for¬ 
mulas  of  the  models  and  algorithms.  To  resolve  this  problem 
operations  researchers  have  concerned  themselves  with  the  de¬ 
velopment  of  accurate  estimation  techniques  and  demand  fore¬ 
casting  methods.  One  of  the  techniques  which  has  received  a 
good  deal  of  attention  in  the  current  literature  is  exponen¬ 
tial  smoothing,  a  method  of  estimation  and  forecasting 
developed  by  Brown  [Ref.  2].  The  basic  idea  of  the  smoothing 
approach  is  to  use  a  weighted  moving  average  of  demand  data  in 
such  a  way  as  to  exponentially  discount  the  oldest  data  in  the 
demand  sequence  and  place  most  of  the  weight  for  an  estimate 
on  the  most  recent  data  available.  This  exponential  discount¬ 
ing  of  past  demands  is  controlled  by  the  selection  of  a  smooth¬ 
ing  constant  which  determines  the  relative  value  of  the  weight 
assigned  to  the  data  in  the  estimate.  The  exponential  smoothing 
technique  has  been  questioned  on  theoretical  grounds  by  Zehna 
[Ref.  10]  and  others.  It  has  been  suggested  that  the  classical 
maximum  likelihood  methods  will  produce  better  results  in  a 
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number  of  demand  models  and  some  simulation  results  are  avail¬ 
able  which  indicate  that  this  is  the  case  [Refs.  6  and  11]. 

The  purpose  of  this  thesis  is  to  examine  the  problem  of 
forecasting  demand  and  explore  several  alternative  methods 
which  should  be  considered  as  possible  forecasting  techniques 
for  particular  demand  models.  A  simulation  analysis  of  max¬ 
imum  likelihood,  smoothing,  and  Bayesian  estimation  methods 
was  conducted  in  order  to  ascertain  important  or  noteworthy 
properties  of  the  techniques  when  applied  to  normally  distrib¬ 
uted  random  demand  in  the  constant  mean  and  linear  demand 
models . 

Throughout  this  report  the  idea  of  a  risk  involved  in  in¬ 
ventory  operations  plays  a  central  role.  The  demand  models 
studied  are  assumed  to  be  applied  to  an  inventory  system  in 
which  stock  is  controlled  by  means  of  a  periodic  review  of 
inventory  levels.  That  is,  at  certain  specified  periods  of 
time,  monthly,  quarterly,  etc.,  the  demand  for  items  is  deter¬ 
mined  along  with  the  current  inventory  balance.  These  data 
are  used  to  requisition  replacement  stock  to  insure  that  the 
system  does  not  run  out  of  stock  too  often.  If  the  demand  were 
deterministic  there  would  be  no  problem  in  deciding  how  much 
to  order.  However,  the  demand  in  this  case  is  assumed  random 
and  predictions  of  the  demand  in  future  periods  must  be  made 
in  order  to  set  reorder  levels.  It  is  assumed  that  resupply 
is  instantaneous  so  that,  if  it  can  be  determined  at  the  end 
of  a  period  that  within  a  specified  probability  the  demand  in 
the  next  period  will  be  less  than  a  certain  quantity,  then  the 


( 

reorder  level  is  that  expected  upper  limit  on  the  next 
period's  demand.  The  reorder  quantity  is  the  reorder  level 
less  the  amount  of  stock  on  hand.  3y  deciding  what  probabil¬ 
ity  of  running  out  of  stock,  called  stock-out  risk,  is 
acceptable,  the  decision  maker  sets  a  risk  level  that  deter¬ 
mines  the  ordering  policy.  Consequently  estimating  parame¬ 
ters  and  predicting  demand  in  general  is  a  central  issue  in 
the  determination  of  reasonable  reorder  policies. 

In  Section  II.  the  theoretical  development  of  the  maximum 
likelihood  estimation  technique  is  presented  along  with' a 
simplified  version  which  was  used  by  Zehna  [Ref.  11]  for 
comparison  with  exponential  smoothing.  Section  III.  is  a 
summary  of  the  theoretical  basis  for  exponential  smoothing 
developed  by  Brown  [Ref.  2].  In  addition,  at  the  end  of  Sec¬ 
tion  III  smooth, ing  is  applied  to  a  case  not  previously  con¬ 
sidered  by  Brown.  Section  IV  presents  a  comparison  of  maxi¬ 
mum  likelihood  and  smoothing  with  consideration  given  to 
theoretical  basis,  ease  of  application,  and  simulation  re¬ 
sults.  In  Section  V  a  Bayesian  procedure  for  demand  estima¬ 
tion  and  prediction  is  developed  for  the  case  when  the 
variance  of  the  demand  distribution  is  known.  The  resulting 
Bayes  estimation  method  is  compared  to  exponential  smoothing 
on  the  basis  of  computer  simulation. 

Since  simulation  was  used  throughout  this  study  a  special 
section  on  this  technique  is  presented  at  the  beginning  of 
Section  IV.  It  should  be  noted  that  in  previous  cases 
studied  by  Zehna  [Ref.  11]  and  Ornek  [Ref.  6],  simulation  was 
also  used  as  one  method  of  comparing  MLE  and  smoothing.  There 
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is  a 'basic  difference  in  the  approach  in  those  papers  and 
the  approach  taken  in  this  study.  Their  procedure  was  to 
generate  random  demand  over  a  long  period  of  time,  1000 
periods,  making  estimates  with  the  various  methods  at  each 
period,  and  determining  the  sample  risk  (probability  of 
running  out  of  stock)  over  the  entire  time  of  operation. 

The  approach  taken  here,  however,  realizing  that  in  many  ap¬ 
plications  only  a  limited  amount  of  data  may  be  available  or 
used  as  a  basis  for  estimation,  is  to  generate  demand  data, 
make  estimates,  and  determine  the  risks  involved  at  various 
points  in  time,  say  5  periods,  20  periods,  etc.,  over  a  large 
number  of  replications  of  the  experiment.  It  is  felt  that 
the  results  of  this  simulation  method  should  prove  useful  in 
determining  which  estimating  techniques  perform  well  when 
estimates  are  based  upon  a  fairly  limited  amount  of  data. 
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II.  MAXIMUM  LIKELIHOOD  ESTIMATION 


A  classical  approach  to  forecasting  or  predicting  the 
value  of  a  random  variable  based  upon  past  observations  is 
to  develop  a  functional  relationship  between  the  observed 
values  of  the  random  variable  and  some  control  variable  us¬ 
ing  regression  analysis.  Applied  to  the  present  circumstances 
one  may  view  the  random  variable  Y  as  demand  and  the  control 
variable  x  as  time.  Demand  is  observed  at  various  times 
xl'  x2'  **"'  xn  an<^  then  inferences  are  made  about  the  de¬ 
pendence  of  Y  on  x. 

In  this  section  the  results  of  the  General  Linear  Hypotn- 
esis  are  examined  as  they  apply  to  forecasting  demand  when 
the  mean  demand  is  a  linear  function  of  time,  that  is, 

E(Yi)  =  a  +  bxi  . 


It  can  be  shown,  see  Zehna  {Ref.  12],  that  the  maximum  like- 

2 

lihood  estimates  of  0  and  a  are: 
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^  — 1  *2 

3  =  Q  A'  Y  and  0  = 


(Y  -  A  P  )  (Y-  AS) 


n 


respectively,  where  Q  =  A'  A.  It  is  further  true  that  6_  , 


2^-l\ 


the  random  vector  corresponding  to  3_  is  N2  (3_,  0  Q  )  where 
N?  indicates  a  bivariate  normal  distribution.  Algebraically 
then , 


Q  = 


n  nx 


-  r  2 

nx  I  x. 

i=l 


,  and  Q 


-1 


ns. 


n 


l  xf  _ 

i=l  1 


nx 


-nx  n 


n  n 

where  s^  =  1/n  \  (x.  -  x)  and  x  -  1/n  \  x.  • 

x  i=l  1  i=l 


Then 


|  =  Q_1A« Y  =  1/ns^ 


11  2  —  n 
y  J/-i  * x  J.Vi 

1=1  i=l 


n 


l  (xi  -  x)  (yi  -  y) 
i=l 


The  above  reduces  to 

n 


b  - 


)  x.y.  -  nxy 
i=l _ 

"  2  -2 
I  x 1  “  nx 
i=l 


(2-1) 


a  =  y  -  xb 


(2-2) 


2 .  Variance  of  the  Estimators 


Since  |  is  N2(g_,  a2 Q_1)  it  follows  that: 
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These  quantities  depend  only  upon  the  total  number  of  periods 
of  demand  observed  and  the  variance  of  the  underlying  distri¬ 
bution  in  the  model.  The  relative  sizes  of  these  variances 
are  indicated  in  the  following  table. 


TABLE  I 

*  » 
VARIANCE  OF  a  and  b 


n 

ft  2  " 

V (a) /a 

V (£) /a2 

ft  £  o 

COV (a ,  b )/a 

10 

0.4667 

0.0121 

-  0.0667 

50 

0.0825 

0.0001 

-  0.0025 

100 

0.0406 

0.0000 

-  0.0006 

1000 

0.0040 

0.0000 

-  o;oooi 

.  2  . 
The  maximum  likelihood  estimate  of  a  is 


a2  =  1/n  l  (yi  -  -  &2*i)2  * 

i=l 

It  is  well  known  that 
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c* 

so  that  — j  is  an  unbiased  estimator  for  o  . 


3 .  Prediction 

The  main  purpose  of  using  estimation  in  inventory 
applications  is  to  forecast  demand  at  some  future  period  of 
time.  Most  often  it  is  necessary  to  observe  demand  in  peri¬ 
ods  1,  2,  n,  and  utilizing  these  data  predict  what  de¬ 

mand  will  be  in  the  next  period  in  order  to  set  a  reorder 
level  such  that  within  tolerable  risk  limits  the  system  will 
not  run  out  of  stock. 


The  demand 


y  at  time  x  4  ;c. ,  •  •  • ,  x  .  may 
2  o  o  '  1 '  n '  2 


considered  as  a  value  of  the  random  variable  Y  which  is 

o 

2 

distributed  N(a  +  bx  ,  a  ) .  Then 


Yo  =  Nn+1(AQ  S,  a2I> 


where  A  = 
— o 


.  It  can  be  snown  that 


£  £  /  9  (x  -  x  ) 

Yo  -  a  -  b*0  -v  N  O,  a  1  +  1/n  +  ■  . - 

V  L  l  (*i  '  x)2 

\  i=l  1 


and  is  independent  of  no  .  It  follows  that 


A  A 

A  A 

Y  -  a  -  bx 
o  o 


o  1  +  1/n  + 


(x  -  xor 

n  -  2 
Jl(Xi-X) 


a  (n  -  2) 


has  a  t-distribution  with  n-2  degrees  of  freedom.  Thus  if 
a  reorder  level  R  is  set  by  means  of 


A  A  A 


(X  -  X  ) 

R  =  a  +  bx  +  t  o  ,  „  S  /  1  +  1/n  +  - - -  (2-4) 

o  n-2, 1-a  y*x/  '  n  _  „  ' 

l  (x.  -  X) 

i=l 


where  a  and  b  are  as  defined  previously. 


and  t  „  ,  is  the 
n-2 , 1-a 


(l-a)th  percentile  of  the  t-distribution  with  n-2  degrees 
of  freedom,  the  probability  that  the  demand  in  period  xq  will 
be  less  than  or  equal  to  R  is  1-a.  This  assumes  that  a  risk 
of  (lOO  aj%  is  acceptable. 

It  should  be  noted  that  the  length  of  this  one-sided  uppe 
prediction  limit  for  Yq  increases  as  the  period  for  which  the 
forecast  is  made  differs  in  time  from  x.  This  fact  should 
discourage  "Extrapolation"  too  far  above  the  midpoint  of  the 
sampling  interval.  However,  the  forecast  should  be  good  for 
the  (n+l)st  period. 


B.  CONSTANT  MEAN  MODEL 

The  constant  mean  model  may  be  considered  as  a  special 
case  of  the  linear  model  above  with  b  =  0.  For  this  model, 

Yi  =  a  +  ei  w^ere  ei  ^  i=l, 2,***  ,n 

1.  Estimates  of  the  Coefficient  and  Variance 

Utilizing  the  same  procedure  as  before  it  is  found 


that 


a  =  1/n  l  y.  =  y  , 
i=l  1 


(2  -5) 


E  (a)  =  a  ,  V(a)  =  a  /n 


(2-6) 


°  =  1/n  l  (y.  -  y) 

i=l  1 


2.  Prediction 


Again  it  can  bo  shown  that  Y  -  a  has  a 
J  o 

A 

2  ^  2 
N(0,  a  [1  f  1/n])  distribution  and  is  independent  of  no 

n*2  2 

-~r  has  av  distribution  so  thct 
a2  Xn"2 

A 

A 

Y  -  a 
o 


o  i/  1  +  1/n 


is  distributed  as  t 


n-1  * 


o  (n-1) 


It  follows  that  setting  a  reorder  level  by  means  of 


R  =  a  +  t  ,  S  f  1  +  1/n 

n-1,  1-a  y 


(2-7) 


where  S  =  /  l  (y.  -  a) 2 

l/^W- 

will  yield  the  desired  risk  a  . 


C.  LINEAR  MODEL  WITH  Y-INTERCEPT  =  0 

In  many  cases  it  might  be  reasonable  to  assume  that  there 
is  no  minimum  initial  demand  imposed  on  the  system,  i.e.,  at 
time  x  =  0,  Y  =  0,  and  that  demand  grows  linearly  with  time. 
This  may  be  expressed  as 

2 

=  bx^  +  where  'v  N(0,  o  )  i=l,2, • • • ,n. 


-W'  ,*«■ 

r  •  : 

;  • 
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In  matrix  notation 


Y  =  Ab  +  e  where  A  = 


x. 


X, 


X 


n 


1.  Estimate  of  Coefficient  and  Variance 


Pioceeding  as  before, 
n 


Q  =  A'A  =  I  xf  Cf 1  -  1/  l 

i=l  1  i=l  1 


n 


b  =  Q  1 Y 


n 

I 

i=l  1  1 
n  o 
l  x2 
i=l  1 


(2-8) 


0  =  (Y  -  Ab)  '  (Y  -  Ab)  =  1/n  l  (y . 

i=l 


bx. ) 
i 


V(b)  =  o2Q  1 


n  9 
l  x2 

.  L ,  l 
1=1 


E(b)  =  b 


(2-9) 


2.  Prediction 


As  in  the  previous  case  it  can  be  shown  that 

2 

S  2  X 

Y  -  bx  has  a  N(0,  0  [1  +  — - - ])  distribution, 

o  o  n  ^ 

I  X? 

i=l  1 


no.  2 
— is  x 


n-1  ' 


A  A  2 

and  further  that  Y  -  bx  and  n0  are  independent. 

o  o 
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Thus 


Y  -  bx 
o  o 


^  t  , 
n-1 

ct“  (n-1) 

The  appropriate  reorder  level  to  obtain  a  risk  a  is 


R  =  bx  +  t  ,  ,  S 

o  n-1,  1-a  y*x 


x 


i  t- 


o 


v  2 
>  x- 

i=l  1 


n 


where  S 


y  *x 


.1 ,<*i  -  bxi> 
i=l 

n-1 


(2-10) 


and  t  ,  ,  is  the  (l-a)th  precentile  of  the  t-distribution 

n-1,  1-a 

with  n-1  degrees  of  freedom. 


D.  CONSTANT  MEAN-SIMPLIFIED  PROCEDURE 

A  more  easily  applied  technique  for  setting  the  reorder 

level  in  the  constant  mean  case  was  demonstrated  by  Zehna 

[Ref.  11]  and  Ornek  [Ref.  6]  with  very  good  results.  The 

procedure  was  used  only  as  an  attempt  to  compare  the  accuracy 

of  maximum  likelihood  estimation  with  that  of  estimating 

parameters  using  exponential  smoothing. 

Since  demand  is  normally  distributed  with  mean  y  and 
o 

variance  a  there  is,  of  course,  a  theoretical  reorder  level 
y  +  Ko  such  that 

Pr  (Y  <  y  +  Ka)  =  1  -  a  (2-11) 
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( 

where  K  is ' the  (1  -  a) th  percentile  of  the  standard  normal 
random  variable  —  N(0,  1)  . 

Hence  maximum  likelihood  method 3  may  be  used  to  estimate 
p  and  a  separately  and  then  these  estimates  combined  as  in 
Eq.  (2-11)  to  set  the  reorder  level.  In  this  way, 


and 

R  =  p  +  Kc  .  (2-12) 

This  method  of  course  fails  to  consider  the  actual  distri¬ 
butions  of  p  and  a  so  that  the  true  risk  so  obtained  is  prob¬ 
ably  not  exactly  a,  but  for  many  purposes  the  ease  of 
application  may  make  the  tradeoff  worthwhile. 

It  is  possible  to  make  statements  about  the  expected 
value  and  variance  of  this  estimate  of  reorder  level  as  the 
distributions  of  p  and  a  are  well  known,  and  the  correspond¬ 
ing  random  variables  are  further  known  to  be  independent. 


A  A  ^ 

From  Eq.  (2-6) ,  E(p)  =  p  and  V(p)  -  a  /n  .  It  is  shown 
in  Hald  [Ref.  5]  that  a  has  a  Chi-distribution  and  in 
particular 


( 


n  -  1  ,  2  2  ,  ,2  1 

V(a)  = - b  o  where  b  -  1  - 

n  n  n 


£  .  1  2 
V(a)  =  —  •  a 

2n  n->°° 


0 


;Y 


Thus  E(R)  =  y  +  K  y  1  - 

£  K2  +2  2 

and  V  (R)  =  -  cT 


2n 


2n  n-+°° 
>  0  . 


■>  y 


2  =  1 
n  2  (n  -  1) 


+  Ka  , 
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III.  EXPONENTIAL  SMOOTHING 


As  indicated  in  the  introduction,  exponential  smoothing 
is  a  method  of  forecasting  discrete  time  series,  in  this 
case  future  demand,  based  upon  past  observations  of  the  sys¬ 
tem.  The  major  source  of  information  on  the  development  and 
applicatio i  of  exponential  smoothing  is  the  book  by  Robert 
G.  Brown  [Ref.  2] .  In  fact,  Brown  introduced  the  concept  and 
the  very  name  exponential  smoothing  in  an  earlier  publication. 
Unless  otherwise  indicated  all  of  the  following  discussion  of 
the  methodology  and  theory  may  be  found  in  the  reference 
noted  above . 


A.  EXPONENTIAL  SMOOTHING  OF  PAST  OBSERVATIONS 

The  basic  idea  behind  exponential  smoothing  is  to  assign 
weights  to  past  observations  so  as  to  place  the  most  weight 
upon  recent  observations  and  less  and  less  weight  on  the 
older  pieces  of  demand  data,  thereby  limiting  the  effect  on 
the  estimate  of  "dated"  information.  This  may  be  accomplished 
by  selecting  a  smoothing  constant  0  <  a  <  1  which  is  exactly 
the  weight  placed  on  the  current  observation.  The  smoothing 
operator  is  given  by 
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where  3  =  1  -  a  and  y^,  ,  *••,  yt  arc  the  observations  of 

demand  in  the  first  t  periods,  and  y  is  some  initial  demand 

often  taken  to  be  zero.  For  application  in  this  thesis  it 

will  be  assumed  that  y  =0. 

J  o 

Y  S^Cf)  then  is  the  exponentially  smoothed  estimate  of 
demand  in  the  (t  +  l)st  period  based  upon  t  periods  of 
observation . 

Brown  claims  that  exponential  smoothing  may  be  utilized 
effectively  to  accurately  estimate  the  coefficients  in  vari¬ 
ous  models  of  discrete  time  series  such  as  demand  or  sales 
forecasts.  The  idea  certainly  has  intuitive  appeal  and  seems 
to  be  widely  used  as  a  forecasting  technique. 

For  applications  of  exponentia  L  smoothing  the  number  of 
coefficients  to  be  estimated  in  a  model  is  referred  to  as  the 
degrees  of  freedom  of  the  model.  Corresponding  to  each  degree 
of  freedom  is  an  order  of  exponential  smoothing  defined  by 
applying  the  smoothing  operator  recursively.  For  single 
smoothing,  as  first  order  smoothing  is  called,  Eq.  (3-1)  iruiy 
be  written, 

St(Y)  =  ayt  +  3St_1(Y)  ‘  (3-2) 

where  St-1(Y)  is  the  current  smoothed  estimate  prior  to 
observing  y^.  Nth  order  smoothing  may  then  be  defined  as 

S"(Y)  =  S[S1J";L(Y)]  =  aSj*1(Y)  +  3S1J_1(Y) 

where  S°(Y)  is  defined  to  be  Sfc(Y) .  As  a  special  case, 
double  smoothing  becomes 

S*(Y)  =  <xSt(Y)  +  3S^_1(Y)  (3-3) 
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B.  CONSTANT  MEAN  MODEL  -  SINGLE  SMOOTHING 

Assuming  that  demand  Y  is  distributed  normally  with  mean 
2 

y  and  variance  o  then 


Yi  =  y  +  where  ^  N(0,  a2)  i=l,2,  ***,t  . 

Since  there  is  but  one  coefficient  to  be  estimated,  single 
smoothing  is  called  for  to  estimate  y.  After  t  periods  have 
been  observed  the  exponentially  smoothed  estimate  of  the  mean 
of  the  demand  distribution  is  given  by 


y  =  Sfc(Y) 


aYt  +  3St_l(Y) 


t-1 

a  l 


i=0 


t-i  * 


(3-3) 


t-1  .  K 

Note  that  FJ(ju)  =  «E(Y)  £  31  =  )j(1  -  3  ),  where  y  is  the  ran- 

i=0 

lim  ~ 

dom  variable  associated  with  u,  so  that,  .  1  E(y)  =  y.  Thus, 

r  r->co 

y  is  asymptotically  unbiased.  Also, 

_  2  2  tv1n2i  _  2  2  rl  -  B2t 

V(y)  -  aa  £8  -ao  l- — = — L 

i=0 


1  +  3 


(3-4) 


li.ni  —  2  ry  . 

and  V  ( y)  =  a  ( ^-— ^— ■)  ,  a  non-zero  quantity. 

In  attempting  to  estimate  the  underlying  variance  of  the 
demand  distribution  (not  considered  a  coefficient  in  the 
model),  Brown  chooses  to  use  mean  absolute  deviation,  MAD  for 
short,  a  measure  of  variability  which  has  questionable  theo¬ 
retical  basis  and  properties  [Ref.  10]  .  MAD  is  defined  as 

A  =  E(|Y  -  yj)  .  (3-5) 


It  is  well  known  that  when  Y  is  normally  distributed  the 
ratio  of  A  to  a  is  ,  about  0.79  8.  This  relationship  does 

not  hold  however  in  several  other  important  probability 
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distributions .  To  estimate  a  Brown  first  estimates  A  by 
smoothing  forecast  errors  defined  as 

e(t)  =  yt  -  St_x(Y)  ,  (3-6) 

the  forecar. t  error  in  period  t.  The  smoothed  estimate  for 
A  is  given  by 

or  in  operator  form 

At  =  a |  e t|  +  3  Afc_1  .  .(3-7; 

Then  the  smoothed  estimate  of  a  is  given  by 


implicitly  assuming  the  smoothed  estimate  for  A  enjoys  the 
same  invariance  principle  known  to  be  true  for  maximum  like¬ 
lihood  estimates. 

It  has  been  clearly  pointed  out  by  Zehna  [Ref.  11]  and 
others  that  such  a  procedure  as  that  described  above  is  sta¬ 
tistically  suspect.  Several  attempts  have  been  made  to  de¬ 
velop  the  distribution  theory  necessary  to  make  theoretical 
judgments  concerning  A  and  the  invariance  principle  implicit 
in  obtaining  a  but  so  far  the  mathematics  has  proven 
intractible  [Ref.  6] . 

Disregarding  the  statistical  questions  involved.  Brown 
combines  the  smoothed  estimates  of  y  and  o  as  in  Eq.  (2-12) 
to  obtain  a  smoothed  estimate  of  reorder  level.  In  this  way, 

R  =  y  +  Ka 
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(3-9) 


where  K  is  the  appropriate  percentiLe  of  the  N(0,  1) 
distribution. 

The  initial  condition  Sq(Y)  is  either  zero  or  some  best 
guess  as  to  what  the  value  of  p  will  be  prior  to  making  any 
observations.  The  effect  of  this  initial  value  is  negli¬ 
gible  after  30-45  periods  for  .  1  <_  *  <.  .9 

Some  remarks  may  be  made  concerning  the  expected  value 
and  variance  of  R,  the  random  variable  corresponding  to  II, 

£ (a)  -Y JY'2?  E(S) 
t-1  i 

E(A)  =  a  l  3  E(|  e  .  |  )  . 
i=o 

E(|  et_^  |)  is  shown  in  [Ref.  11]  to  be  a 
so  that 

E(o)  «  (1  -  3t)a  0  '  (3-10) 

indicating  that  a  is  asymptotically  unbiased  if  all  the 
underlying  assumptions  are  valid. 

As  stated  earlier,  no  analytical  results  have  been  ob¬ 
tained  concerning  the  distribution  of  A  and  the  problems 
involved  extend  into  the  intractibility  of  obtaining  the  exact 
variance  of  c.  However,  Ornek  [Ref.  6)  was  able  to  develop  a 

method  for  approximating  V(a)  numerically.  v(a)  is  dependent 
o 

upon  both  a  and  the  value  of  the  smoothing  constant  a.  That 
is, 

V(d)  =  C2a2  (3-11) 


where  C  may  be  found  in  the  following  table. 
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TABLE  II 

a  VARI/^NCE  FACTORS 


a 

C 

t 

0.10 

0.1745 

t 

> 

80 

0.15 

0.2173 

t 

> 

50 

0.20 

0.2553 

t 

> 

35 

0.25 

0.2905 

t 

> 

30 

0 . 30 

0 . 3239 

t 

> 

25 

0 . 35 

0.3561 

t 

> 

20 

0.40 

0.3876 

t 

> 

.15, 

The  values;  of  C  were  calculated  by  Ornek  [Ref.  6]  and  the 
number  of  periods  necessary  for  a  good  approximation  were 
determined  in  the  course  of.  computer  simulation  and  testing 
of  the  model  . 

Using  the  results  of  Eqs .  (3-3)  -  (3-11)  the  expected 
value  of  R  may  be  written 

E(R)  =  (1  -  S1)  [y  +  K a)  ^— >  vi  +  Ka  , 
and  the  variance  of  R  may  be  approximated  as 

V (R)  =  [(I-^T)(1  -  $2t)  +  C2K2]a2  (3-12) 

assuming  that  y  and  a  are  independent.  Again,  the  variance 
of  R  is  non-zero  even  for  large  t. 
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C.  LINEAR  MODEL  -  DOUBLE  SMOOTHING 

If  the  demand  in  period  t  is  of  the  form 

2 

=  a  +  bt  +  et  where  e  %  N(0,  a  ) 


then  E(Y^)  =  a  +  bi  and  V(Y^)  =  a ^  , 
1.  Estimate  of  the  Coefficients 


i— 1 , 2  ,  • • •  ,  t 


At  time  t,  the  coefficients  are  estimated  by  a  com¬ 
bination  of  both  single  and  double  smoothing  and  the  demand 
at  time  t  +  x  is  estimated  as 

it+T  =  St  +  V 

where  x  is  the  number  of  periods  in  the  lead  time.  The  esti¬ 
mates  of  the  coefficients  developed  by  Brown  are 


at  =  2St(Y)  -  S*(Y) 
bt  =  a/P  [St(Y)  -  S*(Y)  ] 


(3-13) 


where  St(Y)  andS^(Y)  are  given  by  Eqs .  (3-2)  and  (3-3).  It 
is  important  to  note  that  a^  is  not  an  estimate  of  the  para¬ 
meter  a  in  the  model  but  rather  an  estimate  of  the  current 
point  on  the  theoretical  line  at  time  t.  Thus  Yt+^  = 

+  btx  is  equal  to  the  estimate  of  the  current  theoretical 
demand  increased  by  the  estimated  slope  multiplied  by  the 
number  of  periods  in  the  lead  time.  Usually  x  will  be  equal 
to  one  and  the  forecast  will  be  for  the  next  period  only. 

2.  Initial  Conditions 

The  initial  conditions  for  the  linear  model  are 
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TABLE  III 

VARIANCE  OF  SMOOTHING  COEFFICIENTS 


a 

A 

B 

C 

0.10 

.12611 

.00029 

.00539 

0.15 

.18982 

.00107 

.01262 

0.20 

.25377 

.00274 

.02332 

i  0.25 

.31778 

.00582 

.03790 

0.30 

.38164 

.01099 

.05679 

1  0.35 

.44508 

.01909 

.08045 

i 

1 

i 

_oj 

.50781 

.03125 

.10  9  38 

absolute  differences  between  the  estimated  point  on  the  line, 
Pt,  and  the  actual  demand  observed  in  period  t. 

D.  LINEAR  MODEL  -  Y  INTERCEPT  =  0 

When  the  Y  -  intercept  is  known  to  be  zero  the  model  is 
given  by 

2 

Yf  =  bx^  +  where  'v  N(0,  a  ),  i=l,2,**»,t  . 

Brown  does  not  consider  the  model  as  a  special  case  so  no 
smoothed  estimate  for  b  is  given  in  [Ref.  2] .  Following  the 
general  smoothing  methodology,  it  is  assumed  that  only  single 
smoothing  is  necessary  to  estimate  the  single  coefficient  in 
the  model.  At  successive  periods  of  time  x^,  x^,  •••# 
the  corresponding  demand  y^,  ,  • • • ,  Yt  is  observed.  A 

natural  point  estimate  of  the  slope  at  time  x^  would  be 

Bk  =  yk/xk 
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So(y)  =  ao  -  (3/a)b0 


Sq ( Y)  =  ao  -  2(3/a)bo 


(3-14) 


where  and  b  are  initial  guesses  as  to  the  values  of  the 
coeff icients  a  and  b.  Again  these  initial  conditions  are 
transient  and  have  no  effect  on  the  estimate  after  30-45 
periods . 

3 .  Variance  of  the  Esti mators 

Using  multivariate  analysis  Brcwn  claims  the  follow¬ 
ing  asymptotic  formulas  for  the  variances  of  a  and  6  in  the 
linear  casa. 


V(I.)  .  *1L±J±.±SLL  C2  =  AO2 

(1  +  3)  J 


V  (6  )  =  — ^ - T 

t  (1  +  3)J 


O2  =  BO^ 


COV(a  ,  t)  -  SLSL±™  a2  =  Co2 
c  c  (1+3) 

where  representative  values  of  A,  B,  and  C  are  presented  in 
Table  III. 

4 .  Setting  the  Reorder  Level 

As  in  the  previous  case  the  smoothing  estimates  of 
the  mean,  nt  =  afc  +  bfct  and  a  as  defined  in  Eq.  (3-8)  may  be 
combined  in  the  form 


Rt  =  yt  +  Kat 


(3-15) 


where  K  is  chosen  to  obtain  the  desired  risk.  In  this  model 
A  is  obtained  from  Eq.  (3-7)  by  smoothing  the  successive 


I 

An  estimate  of  b  can  then  be  obtained  by  smoothing  the  suc¬ 
cessive  observations  of  obtaining 

t>t  =  St(B)  =  a(Yt//xt)  +  |3St-l(B)  * 

Since  E(Y^)  =  bx^  ,  E(Y^/x^)  =  b.  It  follows  that 

t-1  i  4. 

E[S  'B)J=  a  l  0  E(Y  ./x.  .)  =  (1  -  3r)b 

i=0 

and  lim  E(b  )  =  b  which  implies  that  this  smoothed  estimator 
t-*°°  z 

for  b  is  asymptotically  unbiased. 

The  estimate  of  o  is  obtained  as  in  the  previous  two 
models  using  MAD.  In  Eqs .  (3-7)  and  (3-8)  the  forecast  error 

for  this  procedure  may  be  defined  as 
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IV.  COMPARISON  OF  MLE  AND  SMOOTHING 

In  this  section  the  procedures  developed  in  the  previous 
two  sections  are  compared  chiefly  by  means  of  computer  simu¬ 
lation.  Before  detailing  the  analysis  some  general  comments 
on  the  technique  of  simulation  are  In  order. 

A.  COMPUTER  SIMULATION 

1 .  General  Comments 

Computer  simulation  is  primarily  utilized  to  study 
systems  or  procedures  which  cannot  be  adequately  analyzed  by 
means  of  mathematical  analysis  or  hand  calculations.  Some 
advantages  of  simulation  are  the  rapidity  of  calculations, 
magnitude  of  data  which  can  be  generated  and  analyzed,  and 
the  ease  with  which  sensitivity  analysis  may  be  conducted. 

One  of  the  main  disadvantages  is  that  the  results  obtained  by 
simulation  are  known  to  be  typical  of  that  particular  case 
only  and  in  many  cases  may  not  be  accurately  extended  to  oth¬ 
er  sets  of  circumstances  with  any  assurance  that  the  results 
are  true  in  general.  Also,  the  results  of  a  simulation  study 
may  not  be  at  all  applicable  to  an  actual  situation  if  the 
model  developed  for  the  study  does  not  accurately  represent 
the  system  being  analyzed. 

2 .  Use  of  Random  Numbers 

Random  numbers  generated  by  a  computer  routine  are 
actually  pseudo-random  numbers  in  that  the  same  sequence  may 
be  reproduced  at  v;ill  by  means  of  proper  initialization  of 
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the  sequence.  In  the  strict  sense  of  the  word,  such  replica¬ 
tions  are  not  truly  random  and,  therefore,  detract  from 
modeling  realistic  situations  such  as  demand  which  is  known 
to  occur  at  random.  However,  it  is  this  property  of  being 
reproducible  that  makes  pseudo-random  numbers  useful  in  a 
study  which  has  as  its  main  purpose  the  comparison  of  various 
techniques  such  as  methods  of  demand  forecasting,  or  the  ef¬ 
fects  of  variations  in  the  parameters  of  the  model  being  stud¬ 
ied.  To  decide  if  one  demand  forecasting  technique  is 
superior  in  accuracy  to  another,  the  two  techniques  should  be 
compared  under  similar  circumstances,  using  the  identical  de¬ 
mand  data.  This  requirement  also  is  important  when  the 
effects  on  forecasting  accuracy  of  changes  in  parameters,  say 
the  mean  oc  variance  of  normally  distributed  demand,  are  to 
be  measured  or  tested  in  some  way.  If  differences  in  results 
are  to  be  interpreted  as  indicating  an  actual  difference  in 
the  methods  or  parameter  effects,  then  the  experimenter  must 
insure  that  such  simulation  variations  are  not  due  simply  to 
the  randomness  of  the  data  generated. 

3 .  Simulation  Used  in  this  Study  ' 

As  a  basis  for  comparison  of  maximum  likelihood  pre¬ 
diction  and  exponential  smoothing  as  described  in  Sections 
II  and  III,  random  demand  data  were  generated,  computations 
made  according  to  the  method  used,  and  the  results  compared. 
The  various  computer  routines  utilized  were  written  in  For¬ 
tran  IV  and  the  simulation  was  conducted  using  an  IBM  360/67 
computer.  The  method  of  generating  the  actual  data  for  each 
specific  model  is  discussed  briefly  along  with  the  results  in 


*»y»  V*  — 


30 


the  following  sections.  It  cannot  be  emphasized  enough  that 
all  of  the  numerical  results  obtained  were  simulation  results 
and  should  be  interpreted  as  such.  There  is  no  claim  made 
that  the  results  of  this  thesis  are  true  in  general  but  only 
that  they  may  give  some  idea  as  to  properties  of  the  methods 
tested.  The  results  presented  are  but  a  small  fraction  of 
the  total  cases  examined  and  situations  studied.  Some  of  the 
results  of  the  study  have  been  summarized  or  eliminated  en¬ 
tirely  for  the  sake  of  brevity,  but  all  of  the  examples  pre¬ 
sented  are  fairly  typical  of  the  general  results  obtained. 

B.  PRELIMINARY  CONSIDERATIONS 

Before  presenting  the  simulation  results  a  few  comments 
must  be  made  concerning  what  criteria  should  be  used  in  judg¬ 
ing  whether  or  not  a  forecasting  technique  is  satisfactory 
for  a  particular  purpose.  Brown  [Ref.  2]  suggests  the  fol¬ 
lowing  three  criteria  as  a  starting  point:  accuracy?  simplic¬ 
ity  of  computation?  flexibility  to  adjust  the  rate  of  response. 
Accuracy  of  the  forecasting  method  is  discussed  in  the  follow¬ 
ing  section.  For  the  models  examined  in  this  section,  flexi¬ 
bility  of  the  forecasting  technique  is  not  important  in  that 
it  is  assumed  that  the  model  is  good  and  that  the  underlying 
parameters  remain  constant  over  time.  Simplicity  of  computa¬ 
tion  is  discussed  in  the  next  paragraph. 

One  of  the  main  advantages  claimed  by  Brown  for  exponen¬ 
tial  smoothing  is  the  ease  and  simplicity  of  calculation,  to 
include  consideration  of  the  actual  computation  process  and 
the  computer  storage  required  to  store  the  information 
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necessary  to  make  running  computations  over  time.  By 

formulas  (3-2),  (3-3),  and  (3-7)  it  is  easily  seen  that 

smoothing  requires  the  retention  of  only  the  current  value  of 

S(Y)  and  A  for  the  constant  mean  case  and,  in  addition,  the 

2 

current  value  of  S  (Y)  for  the  linear  model.  At  each  obser¬ 
vation  only  multiplication  and  addition  are  required  to  up¬ 
date  the  estimate.  Smoothing  advocates  claim  [Ref.  1]  that 
complete  historical  records  must  be  maintained  in  order  to 
estimate  the  parameters  in  the  model  by  means  of  maximum  like¬ 
lihood  or  least  squares  regression  in  the  linear  case.  -How¬ 
ever,  it  is  well  known  from  decision  theory  that  to  estimate 
the  parameters  in  the  constant  mean  case  when  demand  is 

2  n  n  2 

N  ( y ,  o'  )  ,  the  statistics  )  Y.  and  )’  Y7  are  sufficient  [Ref. 

i=l  1  i-1  1 

4] .  This  means  that  these  two  statistics  are  all  that  must  be 
stored  from  one  period  to  the  next  in  order  to  update  the 
estimates  with  each  new  observation.  Granted,  these  statis¬ 
tics  must  be  combined  at  each  step  with  an  additional  opera¬ 
tion  or  two  required  including  the  taking  of  a  square  root  in 
the  estimation  of  o,  but  with  the  use  of  modern  high  speed 
computers  in  most  large  scale  inventory  control  operations  the 

extra  time  required  for  these  calculations  is  probably  negli- 

n 

gible .  For  the  linear  model  the  statistic  ]>  X^Y.  wliere 

i=l 

is  the  number  of  the  period  in  which  demand  Y^  is  observed 
must  also  be  maintained.  The  identities  which  express  the 
formulas  in  Section  II  in  terms  of  the  above  sufficient  stat¬ 
istics  are  developed  in  Appendix  A. 
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C.  COMPARISON  BY  SIMULATION 

For  simulation  purposes  random  demand  was  generated  in 
accordance  with  the  model  being  studied.  Uniform  random 
deviates,  U(0,1),  were  first  generated  and,  recalling  the 
asymptotic  normality  of  the  sum  of  uniform  random  variables, 
normal  demand  was  in  turn  generated.  The  random  demand  was 
tested  by  several  methods  to  assure  that  it  was  indeed  normal 
with  the  desired  parameters.  The  Chi-Square  test  (d.f.  =  9) 
was  used  and  the  resulting  test  statistic  value  of  12.8  fell 
below  the  critical  value  of  21.7  at  the  1%  level  of  signifi¬ 
cance.  The  normal  demand  was  either  used  directly  in  the 
constant  mean  case  or  converted  for  the  linear  model  by  means 
of  an  appropriate  linear  transformation. 

Most  of  the  results  of  the  simulation  are  presented  in  the 
form  of  tables  depicting  the  methods  of  estimation  and  the 
resulting  sample  quantities  of  interest.  Various  streams  of 
random  demand  were  used  in  many  instances  from  table  to  table, 
but  within  each  table  the  same  data  were  used  for  each  case 
in  order  to  make  suitable  comparisons. 

1 .  Simplified  MLE  and  Smoothing  in  the  Constant  Mean 
Model . 

Since  Zehna  [Ref.  11]  and  Ornek  [Ref.  6]  previously 
reported  on  several  aspects  of  the  case  where  demand  is 
normally  distributed  with  constant  mean  and  demonstrated  what 
appeared  to  be  a  significant  difference  in  the  accuracy  of 
estimates  of  the  mean  obtained  by  maximum  likelihood  and  ex¬ 
ponential  smoothing,  it  was  decided  to  take  that  model  as  a 
point  of  departure  for  the  simulation  study  in  this  thesis. 
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In  this  section  various  results  of  forecasting  demand  using 
the  simplified  maximum  likelihood  approach  Eq.  (2-12)  and 
smoothing  Eq .  (3-9)  are  presented. 

As  mentioned  in  the  introduction ,  the  approach  taken 
was  to  generate  random  demand,  make  estimates  of  the  various 
parameters  after  particular  sampling  intervals,  replicate  the 
experiment  a  number  of  times,  and  then  determine  the  average 
values  of  the  estimates  in  the  particular  periods  and  the 
variability  of  the  estimators  over  the  replicated  experiments. 
Table  IV  presents  these  results  for  estimates  of  reorder  level 
after  15,  30,  45,  and  300  periods  cf  demand  observation  over 
100  replications.  The  parameter  pairs  (y,a)  are  of  the  same 
proportions  as  those  studied  by  Zefna  and  previously  indicated 
to  be  realistic  by  NuvSup.  The  theoretical  reorder  level  was 
determined  from  Eq .  (2-11)  with  the  risk  level  constant  at 
0.05  for  each  parameter  pair.  In  this  case,  and  throughout 
the  study  unless  otherwise  indicated,  the  initial  values  for 
smoothing  were  chosen  in  order  to  give  smoothing  the  best  pos¬ 
sible  starting  conditions,  that  is, 

S0(Y)  =  y ,  and  AQ(Y)  = 

A  smoothing  constant  value  of  a  =  0.20  was  used,  the  value 
presently  being  used  by  NavSup  in  their  forecasting  models. 

The  entries  in  Table  IV  are  the  average  estimate  of 
reorder  level  and  the  sample  standard  deviation  of  the  esti¬ 
mators  over  100  replications  for  the  indicated  parameter 
pairs  and  number  of  periods.  The  results  are  rather  striking 
and  consistent.  In  most  cases  exponential  smoothing  tends  to 
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overestimate  the  reorder  level  while  MLE  underestimates,  a 
result  noted  also  by  Zehna.  However,  the  most  noteworthy 
aspect  is  the  much  larger  variability  of  the  smoothing  esti¬ 
mate  demonstrated  by  a  relatively  h:.gher  sample  standard 
deviation.  As  more  and  more  observations  are  taken  the  sample 
standard  deviation  of  the  maximum  likelihood  estimate  decreases, 
a  fact  that  is  easily  verified  in  the  theory.  Hov/evcr,  this 
property  is  not  demonstrated  by  the  smoothing  estimates . 

This  point  was  suggested  by  Ornek  [Ref.  6]  and  indicated  by 
Eq.  (3-12).  In  fact,  after  300  periods  the  sample  standard 
deviation  of  smoothing  is  consistently  over  six  times  as 
large  as  that  of  maximum  likelihood. 

It  is  indicated  in  Table  IV  that  for  these  particular 
parameter  pairs  the  results  do  not  depend,  on  the  average, 
upon  the  various  values  of  the  parameters.  For  example,  the 
average  reorder  level  estimate  given  by  MLE  after  15  periods 
for  the  pair  (10,1)  is  exactly  1/10 th  the  average  estimate 
for  the  pair  (100,10).  This  result  occurred  in  every  case 
examined.  This  is  not  surprising  since  the  random  demand 
for  each  case  was  generated  by  the  identical  stream  of  random 
numbers,  altered  only  by  the  specific  parameters.  For  this 
reason  it  was  decided  to  consider  further  only  the  parameter 
pair  (100,10)  as  a  typical  case.  Continuing  with  this  ap¬ 
proach,  Table  V  presents  the  results  for  the  (100,10)  para¬ 
meter  pair  with  estimates  being  made  after  5  to  100  periods . 

The  entries  are  the  average  reorder  level  and  sample  standard 
deviation  of  the  estimates  over  100  replications.  These  re¬ 
sults  are  consistent  with  the  ideas  brought  out  by  Table  IV, 
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Actual  Reorder  Level 


TABLE  V — Continued 


Actual  Reorder  Level 


but  some  additional  insight  into  the  source  of  reorder  level 
estimate  variability  may  be  gained  by  examining  Table  V. 

After  100  periods  the  sample  standard  deviation  of  R  for  risk 
level  0.01  is  over  twice  as  large  as  the  same  statistic  for  a 
risk  of  0,!>0.  This  is  because  in  the  estimate  at  the  former 
risk  level,  considerable  weight  is  given  to  the  estimate  of 
a  while  at  the  latter  risk  level,  K  =  0  and  the  reorder  level 
estimate  is  determined  solely  by  |.i .  Thus,  the  weight  given 
to  a  in  the  estimate  decreases  as  the  risk  increases  from 
0.01  to  0..'30  and  the  corresponding  variability  of  R  decreases. 

A 

The  same  general  effect  is  true  for  R  but  to  a  lesser  exten'-. 
This  is  to  be  expected  since,  from  the  theoretical  development, 

•V  *** 

a  is  roughly  twice  as  variable  as  y  in  the  limiting  case  (2.3 
times  for  a  =  0.2) .  For  MLE  the  variance  of  a  is  half  that 
of  |J  . 

To  determine  the  effects  of  staring  conditions,  or 
initial  values,  on  the  ultimate  estimate  given  by  smoothing 
the  same  experiment  was  conducted  comparing  the  perfect  initial 
values  given  in  Eq.  (4-1)  with  Sq(Y)  =  0  and  Aq(Y)  =  o.  The 
results  are  listed  in  Table  VI  for  various  values  of  the 
smoothing  constant  a  to  determine  how  long  the  initial  values 
effect  the  reorder  level  estimate  and  the  variance  of  the 
estimate.  In  the  Table,  Rl  implies  the  zero  initial  values 
and  R2  indicates  the  best  possible  starting  conditions.  The 
approximate  period  in  which  the  effect  of  the  initial  values 
become  negligible  is  indicated  as  the  “catch-up  period." 

One  of  the  most  interesting  results  of  this  Table  is  that  the 
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TABLE  VI 

EFFECT  OF  INITIAL  CONDITIONS 
FOR  SMOOTHING  WITH  VARIOUS  SMOOTHING  CONSTANTS 
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choice  of  smoothing  constant  a  has  Little  effect  on  the  av¬ 
erage  value  of  the  reorder  level  estimate  but  the  standard 
deviation  of  the  estimate  increases  with  increasing  a.  This 
is  not  surprising  since  for  a  stable  system,  as  in  the  cur¬ 
rent  case,  the  smaller  values  of  the  smoothing  constant  are 
clearly  indicated.  This  choice  of  x  causes  the  resulting 
estimator  to  be  less  sensitive  to  changes  in  the  system,  but 
have  much  smaller  variance  than  for  larger  values  of  the 
smoothing  constant. 

One  possible  measure  of  the  accuracy  of  an  estimator, 
at  least  for  comparison  purposes,  is  the  mean  squared  error 
of  the  estimate.  Mean  squared  error  (MSB)  may  be  defined  as 
the  sum  of  the  square  of  the  bias  and  the  variance.  The 
theoretical  MSE  and  average  sample  MSE  for  the  (100,10)  para¬ 
meter  pair  after  200  observations  of  demand  are  displayed  in 
Table  VII  for  the  MLE  and  smoothing  estimates  of  reorder 
level.  The  sample  results  compare  favorably  with  the  theo¬ 
retical  values  which  for  smoothing  are  approximations  based 
upon  the  results  of  Section  III.  Not  surprisingly,  the  re¬ 
sults  favor  the  MLE  procedure,  due  in  large  part  to  the  much 
smaller  variance.  In  fact,  the  variance  in  both  cases  is 
the  dominating  factor  as  the  bias  is  negligible  after  200 
periods . 

As  one  further  illustration  of  the  greater  sample 
variance  of  R,  reorder  level  for  the  (100,10)  case  was  esti¬ 
mated  using  1000  periods  of  demand  observations  and  the  ex¬ 
periment  repeated  100  times.  A  graph  of  the  MLE  and  smoothing 
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TABLE  VII 

THEORETICAL  AND  SAMPLE  MEAN  SQUARED 
ERROR  IN  PERIOD  200 


Theoretical  Mean  Squared  Error 


Risk 

MLE 

Smoothing 

with  Indicated  a 

0.10 

0.15 

0 . 20 

0 . 25 

0.30 

0.35 

0.40 

0.01 

1.87 

21.79 

33.74 

46.50 

60.10 

74.60 

90.05 

106.56 

0.05 

1.13 

13.50 

20.89 

28.75 

37.12 

46.04 

55.53 

65.65 

0.10 

0.91 

10.27 

15.87 

21.82 

2  8.16 

34.89 

42.05 

49.69 

0.62 

6.65 

10.26 

14.08 

18.13 

22.43 

26.99 

31.35 

0.50 

0.50 

5.26 

8.11 

11.11 

14.29 

17.65 

21.21 

25.00 

Average  Sample  Mean  Squared  Error  over  100  Replications 

Smoothing 

v.ith  Indicated  a 

Risk 

MLE 

0.10 

0.15 

0.20 

0.25 

0.30 

0.35 

0.40 

0.01 

1.64 

22.94 

35.64 

50.55 

67.26 

84.72 

102.28 

119.63 

0.05 

1.01 

14.06 

22.29 

32.01 

42.99 

54.56 

66.24 

77. 84 

0.10 

0.78 

10.57 

17.01 

24.64 

33.25 

42.37 

51.60 

60.82 

0.25 

0.53 

6.62 

10 .9  8 

16.10 

21.82 

27.87 

34.04 

40.24 

0.50 

0.47 

4.99 

8.37 

12.16 

16.21 

20.40 

24.66 

28.99 

estimates  of  reorder  level  over  the  100  replications  is  dis¬ 
played  in  Fig.  1.  The  theoretical  reorder  level  for  a  risk 
of  0.05  is  116.45.  Although  the  average  values  of  R  and  R 
are  not  significantly  different,  the  difference  in  variabil¬ 
ity  is  quite  obvious.  In  fact,  even  after  the  1000  periods 
the  sample  standard  deviation  of  R  is  5.654,  while  that  of 
R  is  but  1.007.  In  view  of  many  other  experiments  of  this 
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type  that  were  conducted,  it  appears;  that  for  a  risk  level  of 
0,05  and  a  =  0.2,  a  lower  bound  on  the  standard  deviation  of 
R  is  approximately  a/2.  No  results  to  that  effect  have  been 
determined  analytically  so  the  remark  is  included  here  only 
as  a  matter  of  interest. 

As  a  final  example  of  results  under  this  particular 
case,  a  comparison  of  the  sample  risks  obtained  by  both  meth¬ 
ods  of  estimation  was  made.  Again  with  y  =  100  and  a  =  10 
demand  was  generated  and  the  reorder  level  estimated  after 
various  sampling  intervals.  This  time  the  experiment  was 
repeated  500  times  and  the  risks  over  the  500  replications 
in  the  indicated  periods  are  presented  in  Table  VIII.  In 
several  of  the  cases  the  results  do  not  differ  significantly 
in  one  way  or  the  other,  but  generally  both  methods  tend  to 
yield  higher  risk  than  desired.  In  the  case  of  smoothing 
this  is  due  to  the  variability  of  the  estimate,  and  in  the 
case  of  ML3  the  higher  risk  may  be  attributed  to  the  tendency 
to  underestimate  reorder  level.  The  risk  obtained  by  smooth¬ 
ing  was  greater  than  that  obtained  by  MLE  in  the  majority  of 
instances . 

2 .  Linear  Model 

In  this  section  a  study  similar  to  that  of  the  previ¬ 
ous  section  is  presented  for  the  linear  model.  The  exact 
results  of  the  maximum  likelihood  development  of  Section  II 
are  used  for  comparison  with  the  exponentially  smoothed  esti¬ 
mates.  The  constant  mean  and  zero  Y-intercept  cases  are  pre¬ 
sented  as  special  cases  of  the  general  linear  model. 
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TABLE  VIII 


SAMPLE  RISK  OBTAINED  BY  SMOOTHING 
AND  SIMPLIFIED  MLE  PROCEDURE 


Desired 

Es  ti- 

Number  of 

Periods 

Risk 

mator 

10 

20 

50 

100 

150 

200 

0.01 

A 

R 

.044 

.016 

.018 

.014 

.006 

.010 

R 

.026 

.024 

.036 

.0  30 

.014 

.032 

0.05 

A. 

R 

.070 

.076 

.06  2 

.064 

.046 

.060 

R 

.088 

.078 

..0  70 

.096 

.064 

.076 

0.10 

A 

R 

.134 

.132 

.120 

.134 

.086 

.108 

R 

.120 

.120 

.114 

.142 

.110 

.128 

0.25 

A 

R 

.272 

.270 

.248 

.246 

.224 

.250 

R 

.264 

.284 

.258 

.272 

.256 

.250 

0.5C 

A 

R 

.514 

.480 

.504 

.494 

.500 

.482 

R 

.532 

.480 

.496 

.474 

.488 

.472 

The  first  simulation  experiment  performed  in  this 
section  was  to  generate  random  demand  according  to  the  linear 
model  given  by 

2 

y  =  a  +  bx  +  e  ,  where  e  'v  N(0,a  ) 

for  various  values  of  a,  b,  and  a.  The  maximum  likelihood 
and  smoothing  estimates  of  the  parameters  in  the  model  were 
calculated  after  100  periods  of  demand  observation  and  the 
results  averaged  over  100  replications.  The  estimate  .long 
with  the  sample  standard  deviation  of  the  estimates  are  dis¬ 
played  in  Table  IX.  As  indicated  in  Section  III  the  smooth¬ 
ing  method  does  not  estimate  the  value  of  the  intercept 
separately  but  rather  the  point  on  the  theoretical  line  at 
each  period.  For  purposes  of  the  Table  only,  afc  was 


1 

^tXt  ^rom  average  smoothed  estimate  of  the  point  on  the 

line  for  each  period  of  interest,  and  thus  no  sample  standard 

deviation  of  a,  was  obtained.  The  results  of  Table  IX  are 
t 

TABLE  IX 

ESTIMATE  OF  PARAMETERS 
IN  A  LINEAR  MODEL  AFTER  200  PERIODS 


Parameters 

MLE 

Smoothing  j 

(a,b, a) 

Sample 

Average 

Sample 

Standard 

Deviation 

Sample 

Average 

Sample 

Standard 

Deviation 

(5, .1,1) 

a 

4.84 

.095 

5.23 

* 

b 

.10 

.025 

.10 

.050 

a 

.93 

.077 

1.04 

.29  4 

(50, .2,3) 

a 

50.57 

.  37  8 

50.64 

* 

b 

.20 

.099 

.19 

.151 

0 

2.85 

.200 

3.13 

.882 

(100, .3,4) 

a 

100.24 

.690 

100.87 

* 

b 

.30 

.117 

.  30 

.201 

0 

3.96 

.  347 

4.17 

1.176 

(500, .5,5) 

a 

499.68 

.540 

501.05 

* 

b 

.50 

.249 

.50 

.252 

a 

4.96 

.377 

5.21 

1.470 

*  See  Narrative 


not  very  revealing  except  that  once  again  the  smoothed  esti¬ 
mates  are  more  variable  than  the  corresponding  maximum  like¬ 
lihood  estimates  of  the  parameters.  More  interesting  was 
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the  facti  that  for  many  experiments  using  various  values  of 
a,  bf  and  a  the  sample  risks  obtained  by  both  methods  were 
seen  to  be  independent  of  the  particular  parameters  in  the 
model,  and  appear  to  depend  only  upon  the  number  of  observa¬ 
tions  upon  which  the  estimates  are  based.  For  this  reason 
it  was  decided  to  again  direct  attention  to  only  a  couple  of 
parameter  values. 

For  the  sake  of  brevity  only  a  few  of  the  experiments 
conducted  arc  reported  here,  in  that  the  results  are  much  the 
same  as  in  the  previous  sections.  Not  much  insight  could  be 
gained  from  attempting  to  analyze  the  sample  estimates  of  the 
parameters.  It  appeared  more  beneficial  to  examine  the  sample 
risks  obtained  by  the  two  methods  cf  estimation.  Table  X 
presents  the  sample  risks  obtained  after  the  indiccited  num¬ 
ber  of  periods  over  1000  replications.  It  should  be  pointed 
out  that  the  entries  in  the  Table  are  not  the  fraction  of 
time  the  demand  exceeded  the  reorder  level  estimates  through 
50  periods  for  example,  as  was  the  case  in  the  results  pre¬ 
sented  by  Zehna  and  Ornek,  but  rather  the  percentage  of  times 
the  actual  demand  exceeded  the  reorder  level  in  the  (p+l)st 
period  over  the  1000  samples  of  the  estimates.  The  reasons 
for  proceeding  in  this  manner  were  discussed  in  the  introduc¬ 
tion.  For  this  case  the  parameters  (a,b,o)  are  (50,2,5). 
Again  the  best  possible  initial  values  were  used  for  smooth¬ 
ing,  namely 

SQ(Y)  =  a  -  (a/3) b  ,  and  Sg(Y)  =  a  -  2(a/3)b  . 
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TABLE  X 


LINEAR  MODEL  -  ACTUAL  RISK  OBTAINED 
IN  1000  REPLICATIONS  (a=50,  b=2,  0=5) 


► 

Number  of 

Periods 

10 

20 

50 

100 

A 

A 

- 

/■S. 

/\ 

R 

R 

R 

R 

R 

R 

R 

R 

0.01 

.007 

.036 

.014 

.027 

.011 

.024 

.008 

.029 

0.05 

.045 

.103 

Bfii 

MM 

.049 

.078 

.053 

.068 

0.10 

.092 

.  155 

BSI 

mm 

.103 

.137 

.110 

.114 

0.25 

.233 

.  306 

.256 

.264 

.249 

.274 

.256 

.249 

0.50 

.482 

.516 

.521 

.519 

.510 

.5  36 

.505 

.517 

The  results  here  are  noteworthy  in  that  the  sample  risk  ob¬ 
tained  by  the  smoothed  estimates  exceeded  that  of  MLE  in 
almost  every  case,  in  the  sense  that  the  smoothing  risk  dif¬ 
fered  more  in  one  way  or  the  other  from  the  desired  risk  than 
did  the  risk  obtained  by  MLE.  It  is  felt  that  a  sample  size 
of  1000  for  each  period  makes  the  results  fairly  significant, 
but  once  again  it  must  be  recognize  1  that  the  results  are 
sample  values  from  simulation  and  should  be  judged  accordingly. 

Since  the  simplified  maximum  likelihood  procedure 
yielded  good  results  in  the  constant  mean  case,  it  was  de¬ 
cided  to  try  a  similar  version  for  the  linear  case.  To  ap¬ 
ply  the  methods  of  Section  II  exactly,  and  thereby  be  able 
to  make  exact  probability  statements  about  risk,  the  value  of 
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the  t-distribution  percentile  must  be  changed  with  each  addi¬ 
tional  period  of  demand  observation.  As  n,  the  total  number 
of  periods,  becomes  large,  however,  the  value  of  the  t- 
distribution  percentiles  approach  those  of  a  standard  normal 
distribution,  indicated  by  K  throughout  this  thesis.  In  view 
of  this,  it  was  decided  to  determine  what  effect  replacing 
the  varying  values  of  t  with  the  corresponding  limiting  val¬ 
ue  of  K,  always  smaller,  would  have  on  the  sample  risk  ob¬ 
tained  by  MLE.  The  resulting  estimate  is  referred  to  as  the 
K-version  in  this  report.  In  Table  XI  this  has  been  done  for 
the  general  linear  case  with  parameter  triple  (100,2,10). 

The  Table  entries  are  the  average  reorder  level  estimate, 
sample  standard  deviation  of  the  estimates  over  200  replica¬ 
tions,  and  the  sample  risk  obtained  by  the  indicated  method 
of  estimation.  The  desired  risk  levels  of  0.25  and  0.50 
have  been  omitted  primarily  for  the  sake  of  brevity  once 
again,  but  also  because  the  values  are  probably  too  large  for 
any  practical  application.  Tables  XII  and  XIII  present  the 
same  information  for  the  special  cases  (100,0,10)  and 
(0,2,10)  respectively. 

The  results  of  these  final  three  tables  for  this 
section  are  consistent  with  the  earlier  results.  The  sample 
standard  deviation  of  the  smoothed  estimates  is  relatively 
constant  over  the  number  of  periods  and  the  corresponding 
values  for  both  versions  of  MLE,  although  consistently  larger 
in  the  early  periods,  decrease  over  time  and  are  roughly  a 
third  as  large  as  the  deviation  of  the  corresponding  smoothed 
estimates  after  100  periods.  The  maximum  likelihood  risk 


TABLE  XI 


REORDER  LEVEL,  SAMPLE  STANDARD  DEVIATION  OF  ESTIMATOR, 
AND  SAMPLE  RISK  OVER  200  REPLICATIONS.  LINEAR  (100,2,10) 


S’. >  "  ^ 

Es  ti- 

Number  of  Periods 

1 

mator 

10 

50 

100 

0.01 

R  -  T 

156.63,10.62 

.02 

227.41,3.77 

.01 

326 .11,2.62 
.01 

A 

R  -  K 

150.67,9.59 

.02 

226.59,3.72 

.01 

325. 71,2.60 
.01 

»v 

R 

145.83, 8.54 
.03 

228.11,8.58 

.01 

327.60,8.65 

.02 

0.05 

A 

R  -  T 

144.64, 8.67 
.04 

219.74,3.34 

.05 

318.87,2.31 

.04 

A 

R  -  1< 

142.36,8.36 

.05 

219.42,3.3? 

.06 

318.70,2.30 

.05 

R 

138.92, 7.26 
.08 

220.51,7.24 

.07 

319 .98,7.16 
.06 

0.10 

■ 

139 .19, 7.97 
.08 

215.81,3.16 

.08 

315 .07,2.18 
.13 

R  -  K 

137.96,7.84 

.09 

215.63,3.15 

.09 

314.99,2.18 

.13 

R 

135.27,6.67 

.13 

216.48,6.66 

..11 

315.94,6.49 

.12 

50 


TABLE  XII 


REORDER  LEVEL,  SAMPLE  STANDARD  DEVIATION  OF  ESTIMATOR, 
AND  SAMPLE  RISK  OVER  200  REPLICATIONS.  LINEAR  (100,0,10) 


Desired  Esti- 
Risk  ms  tor 


0.01  R  -  T 

A 

R  -  K 


R  -  T 

A 

R  -  K 

R 

R  -  T 

A 

R  -  K 

R 


Number  of  Period 

s 

10 

50 

100 

129 .78,7.29 

124 . 48,2 . 76 

123.93,1.90 

.01 

.01 

.01 

124.61,6.28 

123.69,2 .69 

123.54,1.87 

.01 

.01 

.01 

121.85,6.71 

124.16,6.98 

12  .73,6.86 

.01 

.03 

.02 

119.37,5.32 

117.04,2.14 

116.78,1.48 

.04 

.05 

.04 

117. 39,4 .97 

116.74,2. 11 

116.64,1.47 

.05 

.05 

.04 

115 .48,5. 36 

117.15,5.44 

116.71,5.36 

.10 

.08 

.0  7 

114.62,4.51 

113.23,1.85 

113.06,1.29 

.09 

.11 

.09 

113.56,4.35 

113.06,1.84 

112.98,1.28 

.13 

.11 

.10 

112.10,4.71 

113.43,4.72 

112.99,4.65 

.13 

.13 

.11 
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TABLE  XIII 


REORDER  LEVEL,  SAMPLE  STANDARD  DEVIATION  OF  ESTIMATOR, 
AND  SAMPLE  RISK  OVER  200  REPLICATIONS.  LINEAR  (0,2,10) 


Desired 

Lsti- 

Number  of  periods 

Risk 

mat  or 

10 

50 

100 

0.01 

K  -  T 

54.62,9.25 

127.04,3.42 

226.08,2.39 

.01 

.01 

.01 

A 

R  -  K 

48.12,8.10 

125.96, 3.34 

225.52,2.36 

.02 

.02 

.01  ■ 

R 

47.62 ,8.80 

126.40,7.11 

225.80,6.93 

.01 

.03 

.01 

0.05 

R  -  T 

43.28,7.43 

119 .46,2.93 

218.88,2.06 

.04 

.05 

.04 

R  -  K 

40.79,6.98 

118.97,2.90 

218.61,2.05 

.06 

.05 

.04 

R 

40.74,7.32 

119.32,5.59 

218.76,5.44 

.06 

.08 

.07 

0.10 

R  -  T 

38.11,6.74 

115.58,2.72 

215.11,1.92 

.08 

.10 

.11 

R  -  K 

3G.90,6 .46 

115.27,2.70 

214.95,1.92 

.09 

.10 

.13 

R 

37.09,6.61 

115.57,4.88 

215.03,4.73 

.10 

.13 

.11 

using  the  exact  t  factors  is  never  larger  than  the  risk 
when  the  corresponding  values  of  K  are  used.  On  the  basis 
of  these  results,  it  would  be  difficult  to  state  that  the 
modified  K-version  of  maximum  likelihood  estimation  of  re¬ 
order  level  is  significantly  poorer  than  the  actual  estimator 
using  t.  In  fact,  in  the  earlier  periods  the  sample  standard 
deviation  of  the  K-version  is  smaller  than  that  of  the 
actual  MLE  estimation  technique.  Ihis  of  course  is  due  to 
the  smaller  weight  placed  upon  the  estimate  of  a  in  the 
earlier  periods. 


V.  BA  YDS  PROCEDURES 


At  times  the  manager  of  a  supply  system  may  have  "a 
priori"  degrees  of  belief  about  the  values  of  the  unknown 
parameters  in  the  demand  distribution  assumed  for  a  partic¬ 
ular  model.  In  other  words,  he  may  subjectively  feel,  prior 
to  actually  observing  any  demand  data,  that  certain  values  of 
the  unknown  parameters  in  the  mode]  are  more  likely  than 
others.  He  may  further  wish  to  include  his  subjective  feel¬ 
ings  along  with  observed  values  of  the  random  demand  to  make 
estimates  of  the  unknown  parameters;.  Bayesian  methods  may 
accomplish  the  above,  combining  the  real  data  and  subjective 
feelings  into  estimates  which  actually  minimize  the  decision 
maker's  risk  as  defined  in  a  very  special  way. 

The  subjective  probabilities  which  measure  the  degree  of 
belief  discussed  above  determine  a  prior  distribution  for  the 
parameters.  The  sample  values  of  the  observed  demand  are 
used  with  the  prior  distribution  to  determine  a  posterior 
distribution  which  is  then  used  in  order  to  obtain  an  esti¬ 
mate  of  the  unknown  parameters.  The  choice  of  a  prior  distri¬ 
bution  for  the  unknown  parameters  is  certainly  arbitrary  and 
has  lead  to  general  controversy  as  to  the  appropriateness  and 
value  of  Bayesian  methods.  It  is  not  much  more  arbitrary, 
however,  than  the  selection  of  initial  values  for  use  with 
exponential  smoothing  estimation  techniques.  These  initial 
values  were  shown  in  Section  IV  to  effect  the  estimate  for  as 
many  as  100  periods,  depending  upon  the  value  of  the  smoothing 
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constant f  another  aspect  of  smoothing  which  is  certainly  open 
to  subjectivity.  Without  attempting  to  discuss  the  appropri¬ 
ateness  of  Bayesian  methods  for  the  demand  forecasting  prob¬ 
lem  at  hand,  and  certainly  without  recommending  its  use 
without  further  examination,  the  concept  is  developed  in  this 
section  as  it  applies  to  the  model  of  normal  demand  with  con¬ 
stant  mean.  The  Bayes  estimation  results  are  then  compared 
to  exponential  smoothing  in  a  variety  of  situations. 


A.  BAYES  ESTIMATION 


One  of  the  many  limitations  of  Bayesian  procedures  is  the 
extreme  difficulty  of  obtaining  joint  prior  and  posterior 
distributions  for  estimating  more  than  a  single  parameter  in 
a  model.  However  unrealistic  it  may  appear,  it  will  be  as¬ 
sumed  here  that  demand  is  normally  distributed  with  mean  0 

2 

and  known  variance  o  .  This  might  even  be  plausible  in  cases 
where  the  demand  for  various  items  of  supply  demonstrates 
about  the  same  degree  of  variability  but  fluctuates  about 
different  and  unknown  mean  values.  In  any  case,  it  is  as¬ 


sumed  that 


f  (y  |  o) 


where  f(y|0)  indicates  that  the  distribution  of  Y  is  condi¬ 


tioned  on  the  value  of  the  parameter  6.  It  can  be  shown 


[Ref,  8]  that  if  the  prior  distribution  assumed  for  6  is 
2 

N(eo,co  )  >  then  the  posterior  distribution  is  also  normal 
with  mean 
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2  2 
OqY  +  a  o 


o 


(5-1) 


and  variance 


+  o 


2 


2  2 
a  ao 


(5-2) 


Thus  the  mean  of  the  posterior  distribution  is  a  linear  com¬ 
bination  of  the  observed  demand  and  the  mean  of  the  prior 
distribution  G Q  .  For  estimation  purposes  it  can  be  shown 
that  the  mean  of  the  posterior  distribution  as  an  estimate 
of  the  parameter  0  minimizes  the  loss  as  measured  by  squared 
error.  That  is,  if  6 1  is  an  estimate  of  0  and  the  conditional 

p 

and  prior  distributions  are  as  specified  above,  then  (0  -  O') 
is  minimized  by  choosing  6 '  equal  to  the  mean  of  the  posterior 
distribution . 

It  should  be  noted  that  in  Eq.  (5-1)  the  weight  given  to 

c2 

the  observation  in  the  linear  combination  is  ■-»-  ° so  that 

cr  + 

o 

if  the  decision  maker  is  fairly  certain  that  0Q  is  the  true 
value  of  the  parameter  0  or  if  he  wishes  to  bias  the  estimate 
in  favor  of  his  prior  feeling,  he  should  choose  a  small  vari¬ 
ance  for  the  prior.  If  the  exact  value  of  0  is  less  certain 

however,  the  variance  of  the  prior  should  be  relatively  large 
2 

compared  to  0  .  In  this  way,  most  of  the  weight  in  the  re¬ 

sulting  estimate  will  be  placed  on  the  observed  demand. 
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B.  BAYES  FORECAST  MODEL 

The  Bayes  estimate  of  mean  demand  developed  in  the  pre¬ 
ceding  section  may  be  used  to  forecast  demand  and  set  an  ap¬ 
propriate  reorder  level  to  obtain  ar  approximate  risk  in  a 

manner  analogous  to  the  procedures  used  in  Section  II-D.  The 

2 

demand  in  period  one  is  observed  and  assuming  a  N(Go,aQ)  prior 
distribution  on  0,  a  posterior  distribution  of  0  given  y^  may 
be  determined.  As  previously  derived,  the  posterior  is  normal 
with  mean 


0*  = 


V'l  +  a2°< 

afTT^ 


and  variance 


al  = 


ao  a 


2  ,  2 
a0  +  a 


The  estimate  of  reorder  level  then  is  given  by 

RJ  =  0|  +  Kc 


Since  information  is  gained  by  the  observation  y^  ,  the  poste¬ 
rior  distribution  determined  in  period  one  is  used  as  the  prior 
distribution  for  period  two.  After  observing  y2  ,  a  new  poste¬ 
rior  distribution  for  0  is  determined.  It  is  again  normal 
with  mean 


°2 


2 

aly2 

2 

°1 


+  o2e 


+  c 


* 

2 


and  variance 


2  2 
°1  + 


a 
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( 

Upon  substitution  for  0* 


and  a 


2 

1 


the  above  becomes 


6*  = 


°clyl+y2)4  g29 

2a2  4  o 2 
o 


2 

o  _  o 

anu  a  ^  ;  2 


20  4-  0 

o 


Continuing  in  this  manner  it  may  be  shown 
after  n  periods  the  estimate  and  variance 


2  * 

by  induction  that 
are 


It  follows  that 

na2  G  +  a 

E  (0* )  -  — - 

n  2  . 

n  Oo  +  o 

lim  E (0* )  =  0 
n*>»  n 

2 

and  lim  0=0 
n-*09  n 


2 


1 


0 

o 


The  reorder  level  set  for  the  (n+1) st  period  is  given  by 

R*  =  0*  +  K0 
n  n 

where  K  is  the  (l-ct)th  percentile  of  the  N(0,1)  distribution. 


C.  SELECTION  OF  PARAMETERS  FOR  THE  PRIOR- -SIMULATION  RESULTS 
The  results  of  Bayes  estimation  depend  strongly  in  most 

cases  upon  the  choice  of  the  parameters  in  the  prior  distribu- 

2  2  .  . 
tion.  The  ratio  of  0Q  to  0  determines  the  weight  given  to 

the  observed  demand  relative  to  the  weight  placed  upon  the 

"a  priori"  best  guess  of  mean  demand  0Q.  To  determine  exactly 

how  much  the  estimate  of  mean  demand  is  effected  by  various 

2 

combinations  of  0  and  0  .  a  limited  amount  of  simulation  was 

o  o 
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conducted.  For  the  constant  mean  case  with  0  =  100  and 

a  =  10,  random  demand  was  generated  for  100  periods.  The 

Bayes  estimate  of  0  was  computed  for  combinations  of  0q  =  0, 

0,  0/2,  0/3,  0/4,  0/5,  and  a2  =  a,  a?/2,  a2/3,  a2/4,  o2/5, 

2  2  2  2 

2a  ,  3a  ,  4a  ,  5a  .  The  results  of  this  investigation  are 
summarized  in  Table  XIV,  the  entries  being  the  Bayes  estimat.es 
of  0  after  5  and  100  periods  of  demand  observation.  Obviously 
the  closer  0q  is  to  the  true  value  of  0  the  better  the  result¬ 
ing  estimate  is.  It  should  be  noted  however,  that  for  this 
9  2 

ratio  of  a “/a  sufficiently  large,  the  Bayes  estimate  ap¬ 
proaches  the  actual  value  of  G  rapidly.  For  example,  in  the 

2  2 

above  case  with  0  =0  and  a  =  5>a*  the  Bayes  estimate  of 

o  o 

0  was  96.6  after  just  5  periods  and  99.5  at  the  end  of  50 
periods . 

From  the  detailed  data  obtained  it  appeared  that  the  Bayes 

estimate  was  fairly  stable  especially  for  a  relatively  good 

selection  of  Gq.  For  this  reason  it  was  decided  to  compare 

the  Bayes  estimates  with  the  corresponding  exponentially  smo- 
.  2 

othed  estimate  when  a  is  known.  The  results  of  this  compar¬ 
ison  are  displayed  in  Table  XV.  For  purposes  of  comparison 
the  initial  guess  of  mean  demand  was  set  at  zero  for  both 
estimation  techniques,  that  is,  Sq(Y)  =  0  and  0Q  =  0 ,  and  ran¬ 
dom  demand  generated  with  0  =  100  and  a  =  10 .  The  mean  demand 
was  estimated  using  each  method  with  various  choices  of  the 
observed  value  weighting  factors,  a  in  the  case  of  smoothing, 
and  a*  for  the  Bayesian  method.  The  entries  in  the  Table  re¬ 
veal  that  the  Bayesian  approach  yields  better  estimates  than 
smoothing  in  the  early  periods  even  for  a  relatively  large 
smoothing  constant.  A  choice  of  a  >.2 a  resulted  in  better 
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TABLE  XV 


AVERAGE  ESTIMATE  OF  MEA\T  DEMAND 
BY  BAYES  AND  SMOOTHING 


0  =  100 

6  =0 

a  =  10 

O 

S  (Y)  =  0 
o 

Number  of  Periods 

Estimator 

Parameter 

Values 

5 

10 

15 

50 

100 

2 

°o  “  0 

83.7 

91.0 

93.7 

9  7.9 

99.1 

2a2 

91.3 

95.4 

96.8 

98.8 

99.5  ' 

BAYES 

3a2 

94.1 

96.9 

9  7.8 

99.2 

99.7 

4a2 

95.6 

97.7 

98.3 

99.3 

99.8 

5a2 

96.6 

98.2 

9  8.7 

99.4 

99.8 

a  =  0 . 1 

41.1 

65.1 

79.3 

99.2 

100.4 

0.2 

67.  3 

89.1 

96.4 

99 . 8 

100.6 

SMOOTHING 

0.3 

83.2 

96.9 

99.5 

99.9 

100.8 

0.4 

92.1 

99.0 

100.0 

99.9 

100.9 

0.5 

96.5 

99.5 

100.3 

99.9 

100.9 

estimates  by  the  Bayes  technique  through  15  periods  than  the 
smoothing  estimator  with  the  usual  smoothing  constant  0.20. 

'  To  determine  the  effect  of  the  prior  parameters  on  the 
sample  risk  obtained  by  the  forecasting  techniques  the  same 
model  was  used  to  generate  demand  for  100  periods  and  the 
experiment  replicated  1000  times.  For  the  case  where  the 
prior  estimates  were  the  best  possible,  that  is,  SQ(Y)  =  0 
and  8q  =  6 ,  the  results  are  presented  in  Table  XVI.  For 
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various  choices  of  weighting  factors,  the  desired  risk  was 
held  constant  at  0.05.  The  results  are  quite  remarkable. 

TABLE  XVI 

SAMPLE  RISK  OBTAINED  Bi  BAYES 
AND  SMOOTHING  OVER  1000  REPLICATIONS 
(GOOD  INITIAL  VALUES) 


6  =  100 

6  =  100 
o 

a  =  10 

S  (Y)  =  100 
o 

Number 

of  Periods 

Estimator 

Parameter 

Values 

5 

10 

15 

50 

100 

2  n  2 

O  =0 

o 

;066 

.056 

.057 

.057 

.055 

BAYES 

2a2 

.069 

.057 

.058 

.057 

.055 

3a2 

.072 

.058 

.058 

.057 

.055 

4cr2 

.073 

.059 

.058 

.057 

.055 

5a2 

.074 

.059 

.058 

.057 

.055 

a  =  0.1 

.060 

.051 

.057 

.057 

.063 

0.2 

.059 

.061 

.061 

.061 

.069 

SMOOTHING 

0.3 

.066 

.0  73 

.069 

.0  70 

.079 

0.4 

.073 

.077 

.079 

.076 

.083 

0.5 

.082 

.0  82 

.083 

.084 

.091 

Except  for  a  few  cases  the  Bayes  forecasting  method  provides 
a  sample  risk  closer  to  the  desired  risk  than  the  smoothing 
process.  Also  noteworthy  is  the  fact  that  the  Bayes  method 
again  appears  much  more  stable  than  smoothing.  This  is  in¬ 
deed  the  case  as  indicated  by  the  sample  standard  deviation 


\ 

in  Table  XVII.  For  only  the  early  periods  and  for  small 


TABLE  XVII 

SAMPLE  STANDARD  DEVIATION  OF  BAYES 
AMD  SMOOTHING  ESTIMATORS  OF  MEAN  DEMAND 


e  =  100 

o  =  10 

O  =  100 
o 

S  (Y)  =  100 
o 

Number 

of  Periods 

Estimator 

Parameter 

Values 

5 

10 

15 

50 

100 

2  2 
o  =  a 
o 

3.95 

3.26 

2.84 

1.95 

1.69 

2  a2 

4.25 

3.40 

2.94 

1.97 

1.72 

BAYES 

3a2 

4.36 

3.41 

2 .94 

1.96 

1.73 

••  4a2 

4.46 

3.47 

2.95 

1.95 

1.72 

5c2 

4.48 

3.4  8 

2.97 

1.95 

1.72 

a  =  0 . 1 

2.31 

2.6  4 

2.68 

2.61 

2.73 

SMOOTHING 

0.2 

3.41 

3.6  8 

3.63 

3.60 

3.65 

0.3 

4.30 

4.51 

4.40 

4.49 

4.44 

0.4 

5.10 

5.27 

5.17 

5.30 

5.16 

0.5 

5.96 

6.03 

5.90 

6.06 

5.88 

values  of  the  smoothing 

constant  does 

the 

smoothed 

estimate 

demonstrate  a  smaller  variance  than  the  corresponding  Bayesian 
estimate.  As  more  and  more  observations  are  made,  the  sample 
standard  deviation  of  the  Bayes  estimate  becomes  smaller 
while  the  variance  of  the  smoothed  estimate  remains  about  the 
same. 
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( 

Thd  same  experiment  was  repeated  with  0  ~  0  and  S  (Y)  -  0 

o  o 

to  determine  how  quickly  the  two  forecasting  methods  could 
demonstrate  sensitivity  to  the  actual  data  and  obtain  a  sample 
risk  close  to  the  desired  risk  0.05.  In  this  case  also  the 
experiment  was  replicated  1000  times  in  order  to  obtain  a 
degree  of  significance  in  the  results.  The  sample  risks 
obtained  by  both  methods  are  displayed  in  Table  XVIII.  The 

TABLE  XVIII 

SAMPLE  RISK  OBTAINED  BY  BAYES 

AND  SMOOTHING  OVER  1000  REPLICATIONS 
(POOR  INITIAL  VALUES) 


e  =  ioo 

eo  =  o 

o 

r— 1 

II 

O 

Sq(Y)  =  0 

Number 

of  Periods 

Estimator 

Parameter 

Values 

10 

15 

50 

100 

2  2 
°o  =  0 

.635 

.294 

.187 

.079 

.071 

0  2 

2  o 

.  310 

.159 

.107 

.068 

.062 

BAYES 

i  2 

3a 

.211 

.128 

.090 

.066 

.061 

4a2 

.168 

.109 

.084 

.065 

.061 

5o2 

.146 

.097 

.080 

.065 

.060 

a 

n 

o 

• 

1.000 

.992 

.737 

.067 

.063 

' 

0.2 

.995 

.390 

.145 

.061 

.069 

SMOOTHING 

0.3 

.761 

.137 

.080 

.070 

.079 

0.4 

.365 

.088 

.080 

.076 

.083 

0.5 

.187 

.084 

.083 

.084 

.091 
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I 

sample  standard  deviations  of  the  estimators  did  not  change 

appreciably  from  the  previous  case  and  are  not  presented. 

The  results  are  as  expected.  When  a  small  value  is  chosen 

for  the  smoothing  constant  the  long  term  results  are  fairly 

accurate,  but  the  results  in  the  eacly  periods  are  far  from 

satisfactory.  If  a  larger  value  is  chosen  for  a  the  smoothed 

estimate  adjusts  much  more  rapidly  but  gives  poorer  results 

in  the  long  run  due  to  the  increased  variance.  The  Bayes 

2  2 

estimator  adjusts  quite  rapidly  for  =  5a  and  the  long 
term  results  for  all  values  tested  are  fairly  accurate. 
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VI.  CONCLUSIONS  AND  RECOMMENDATIONS 


\ 


Before  summarizing  the  conclusions  reached  in  this 
study  and  the  recommendations  which  follow,  it  may  be  worth¬ 
while  to  present  briefly  a  few  of  the  conclusions  reached  by 
Zehna  and  Ornek  in  similar  studies  [Refs.  10,  11,  and  6] . 

1.  It  is  difficult  to  judge  exponential  smoothing  on  a 
theoretical  basis  due  to  a  lack  of  knowledge  concerning  the 
probability  distributions  of  the  smoothing  estimators. 

2.  Many  of  the  analytical  results  obtained  by  Brown 
[Ref.  2]  are  asymptotically  valid  only  although  applied  to 
the  finite  demand  case. 

3.  The  source  of  much  of  the  difficulty  in  determining 
the  distributions  of  the  smoothing  estimators  is  in  the  use 
of  MAD  as  an  estimate  of  variability . 

4.  On  the  basis  of  simulation  results  the  simplified  MLE 
approach  is  to  be  preferred  in  those  cases  in  which  smoothing 
is  presently  employed  by  NavSup. 

A.  CONCLUSIONS 

It  has  been  pointed  out  numerous  times  throughout  this 
thesis  that  the  numerical  results  and  comparisons  made  are 
based  on  computer  simulation  sample  outcomes.  However,  the 
experimentation  was  conducted  with  great  care  taken  in  the 
handling  of  random  numbers  and  sample  size  to  assure  a  degree 
of  significance  in  the  results.  The  results  of  the  various 
comparisons  of  MI.E  and  exponential  smoothing  are  quite  con¬ 
sistent  and  reinforce  the  conclusions  reached  in  the  above 
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references.  Without  regressing  to  the  specific  numerical 
examples  presented  previously  in  this  thesis  it  can  be  con¬ 
cluded  that  on  the  basis  of  the  analysis  performed  in  this 
study  the  smoothing  estimators  are  more  variable  than  any 
form  of  maximum  likelihood  estimation  and  prediction  method 
tested  in  the  linear  or  constant  mean  models.  The  source 
of  most  of  the  variability  in  the  smoothed  estimates  appears 
to  be  in  the  MAD  estimate  of  the  variance  in  the  demand 
distribution.  This  results  in  far  less  consistent  risks  ob¬ 
tained  by  smoothing  than  the  comparable  risks  obtained  using 
MLE .  As  for  the  ease  or  difficulty  of  computation  using  the 
various  methods,  the  exact  MLE  computations  are  certainly 
more  involved  but  probably  not  to  the  extent  of  resulting  in 
significantly  increased  computation  time.  In  any  case  the 
trade-off  between  computational  accuracy  and  computation  time 
is  a  managerial  decision.  In  the  constant  mean  model  the 
simplified  MLE  technique  is  certainly  not  more  difficult  to 
handle  computationally  than  smoothing  and  has  been  shown  to 
produce  significantly  more  stable  and  accurate  results.  The 
Bayesian  approach,  while  not  immediately  applicable  without 
further  study,  appears  to  offer  a  procedure  which,  although 
subjective  in  nature,  has  a  sounder  theoretical  basis  than 
does  smoothing.  On  the  basis  of  simulation  the  Bayes  method 
has  been  shown  to  produce  good  estimates,  be  relatively  flex¬ 
ible  and  sensitive,  and  demonstrate  less  variability  than 
exponential  smoothing. 


B. 


R E40MME N DAT I OH S 


For  the  reasonsdiscussed  above  it  is  strongly  recommended 
that  NavSup  or  other  agencies  using  smoothing  for  demand  fore¬ 
casting  under  the  assumptions  of  the  models  discussed  in  this 
thesis  give  serious  consideration  to  at  least  testing  other 
forecasting  techniques  under  a  variety  of  circumstances  to 
compare  and  contrast  the  results.  This  testing  should  involve 
the  use  of  actual  historical  demand  data  so  the  results  ob¬ 
tained  by  the  alternate  forecasting  techniques  can  be  com¬ 
pared  in  retrospect  with  the  forecasting  results  which  were 
obtained  by  means  of  exponential  smoothing.  Because  of  its 
intuitive  appeal  and  computational  simplicity  exponential 
smoothing  should  be  investigated  further  with  prime  interest 
in  discovering  a  method  to  replace  MAD  as  an  estimate  of 
variance.  If  this  can  be  accomplished  it  may  turn  out  that 
smoothing  is  in  some  respects  an  optimal  method  of  forecast¬ 
ing.  It  is  also  recommended  that  further  study  be  made  of 
the  Bayesian  approach  developed  in  this  thesis  to  determine 
its  applicability  in  actual  situations.  It  would  be  reward¬ 
ing  to  develop  a  Bayesian  procedure  which  could  be  used  to 
jointly  estimate  the  mean  and  variance  of  the  demand  distri¬ 
bution  and  yet  be  relatively  simple  to  apply.  In  cases  such 
as  those  involving  a  constant  proportionality  between  the 
mean  and  variance  of  the  underlying  distribution  it  is  spec¬ 
ulated  that  an  ad  hoc  procedure  could  be  developed  that  would 
involve  an  "a  priori"  estimate  of  the  variance  and  then,  after 
data  are  observed,  the  use  of  the  posterior  distribution  to 
estimate  the  variance  as  a  function  of  the  posterior  mean. 


Finally  it  is  recommended  that  an  inves tiga tion  be  made  into 
the  use  of  smoothing,  MLE,  Bayesian  methods,  or  other  fore¬ 
casting  approaches  in  the  case  of  a  time  dependent  mean  wher. 
demand  is  normally  distributed  but  mean  demand  is  subject  to 
periodic  increases  or  decreases.  In  such  cases  the  esti¬ 
mating  procedure  must  not  only  be  able  to  yield  accurate 
estimates  but  also  be  able  to  detect  the  changes  in  the  mean 
of  the  underlying  demand  distribution  and  adjust  the  fore¬ 
casts  accordingly  [Refs.  3,  7,  and  9). 


APPENDIX  A 


IDENTITIES  USED  IN  MLE  COMPUTATIONS 

As  discussed  in  Section  IV,  one  of  the  disadvantages 
often  attributed  to  maximum  likelihood  estimation  techniques 
applied  to  demand  forecasting  is  the  necessity  of  storing  the 
historical  account  of  demand  data  for  the  entire  period  of 
interest.  The  following  results  show  that  the  above  asser¬ 
tion  is  net  valid  and  that  only  certain  totals  are  required 
to  be  stored  much  the  same  as  with  exponential  smoothing. 

A.  CONSTANT  MEAN  MODEL 
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which  shows 


n  n  2 

that  1  y.  and  Y  y.  are  the  only  data  that  must 
i=l  1  i=l  1 


be  retained  from  period  to  period. 


B.  LINEAR  MODEL  -  Y-INTERCEPT  =  0 


b  = 


n 

l  x<Yi 
i-1 


i 


n 


I 

i=l  1 


y  •  x 


n 


l  (Yi  -  bx.) 
i=l  1  1 

ir~~i — 


2 


^  p  n  ^  /v  n  a  n  n  « 

but  l  (y  -  bx.)  =  l  Yl  -  2b  l  x  y  +  b^  l  xf 
i=l  1  i=l  1  i=]  11  i=l  1 


n  2  n 


=  y  y  .  -  b  V  x .  y . 

.  *•  n  J  i  .  “  ,  1J1 

i=l  i=l 


so  that 


S 


y  »x 


n  „  n 

£  y  .  -  b  £  x  .  y  . 
i=l  1  i=l  11 

n  -  1 


and  the  only  additional  information  which  must  be  stored  is 


\ 

C.  GENERAL  LINEAR  CASE 


n 


b  = 


l  x.y.  -  nxy 
i=l  1  1 


n 


Jx-5 ' 


-2 

nx 


N 

D 


a  =  y  -  xb 


n 


•  x  “l/J1(yi  "  a  "  bXi} 
n  -  2 


Now  omitting  the  limits  on  the  sums  for  simplicity 


n 


Jx(yi  "  3  "  bXi)2  =  ^yi  "  2a^Yi  "  2bIXiyi 


a  o  a  a  a  n 

+  bz£  xf  +  2ab£  x^  +  n  a 


which  becomes  after  substituting  for  a  and  combining  terms 
£y2  -  n  y2  -  2b(Jxiyi  -  n  xy)  +  b2(£x2  -  n  x2)  . 


But  the  quantities  in  parenthesis  are  just  N  and  D  respectively 
so  noting  that 


£2 


D  =  b  •  N 


it  follows  that 


n  AAA  ^  0  o 

I  (y.  -  a  -  bx.)2  =  y ■  -  n  y  - 
i=l  1  1  i=l  1 


bN 


and 


y  »x 


n 


0  A 
l  Yi  ”  n  y2  -  bN 
i=l  1 

- - 
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\ 

Note  that  D  depends  only  upon  the  number  of  periods  n  and 
if  the  demand  is  being  followed  over  successive  periods  of 
time  the  x^*s  are  just  the  first  n  integers  so  that  the  value 
of  D  can  be  stored  for  each  period  of  time  only  and  not 
calculated  for  each  particular  item  of  supply. 
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